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Abstract 

The purpose of this study was to investigate the fatty acid profiles in 18 soybean cultivars grown in Korea. 
A total of eleven fatty acids were identified in the sample set, which was comprised of myristic (C14:0), palmitic 
(C16:0), palmitoleic (C16:l, ©7), stearic (C18:0), oleic (C18:l, ©9), linoleic (C18:2, ©6), linolenic (C18:3, ©3), 
arachidic (C20:0), gondoic (C20:l, ©9), behenic (C22:0), and lignoceric (C24:0) acids by gas-liquid chromatog- 
raphy with flame ionization detector (GC-FTD). Based on their color, yellow-, black-, brown-, and green-colored 
cultivars were denoted. Correlation coefficients (r) between the nine major fatty acids identified (two trace fatty 
acids, myristic and palmitoleic, were not included in the study) were generated and revealed an inverse association 
between oleic and linoleic acids (r=-0.94, p<0.05), while stearic acid was positively correlated to arachidic acid 
(r=0.72, p<0.05). Principal component analysis (PCA) of the fatty acid data yielded four significant principal 
components (PCs; i.e., eigenvalues>l), which together account for 81.49% of the total variance in the data set; 
with PCI contributing 28.16% of the total. Eigen analysis of the correlation matrix loadings of the four significant 
PCs revealed that PCI was mainly contributed to by oleic, linoleic, and gondoic acids, PC2 by stearic, linolenic 
and arachidic acids, PC3 by behenic and lignoceric acids, and PC4 by palmitic acid. The score plots generated 
between PC1-PC2 and PC3-PC4 segregated soybean cultivars based on fatty acid composition. 
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INTRODUCTION 

Soybean is one of the major oilseed crops of the world 
by providing an important nutrient source to the world's 
population. Soybeans have received considerable atten- 
tion because their lipid profile is well balanced in the 
types of unsaturated fatty acids (1,2). Well-balanced lip- 
ids in food sources can provide energy to the body, as 
well as provide essential fatty acids, such as linoleic 
acid. However, despite linoleic acids' nutritional and 
functional properties, the presence of two isolated double 
bonds makes it susceptible to oxidation or degradation 
during processing. To improve nutritional attributes and 
oxidative stability of such oils in oilseed crops, plant 
breeders have successfully modified the fatty acid profile 
in oilseed crops, creating new lines with an increased 
content of oleic acid at the expense of linoleic acid (2,3). 
Liu and White (4) denoted the oxidative stability of soy- 
bean oils with altered fatty acid compositions and they 



found linoleic levels correlated highly with the peroxide 
value (M).95, p=0.01). 

Chemometrics is the combination study of mathemat- 
ical, statistical and other logic-based approaches to effi- 
ciently manage and interpret chemically-derived data (5). 
Principal component analysis (PCA) is a popular multi- 
variate modeling and analysis method commonly used 
in chemometric studies. PCA provides a reduced number 
of variables in a large data set and defines a limited 
number of principal components that describe independ- 
ent variation structures in the data. When more than three 
variables have been measured, visualization of the data 
by various plotting systems is then possible (6). There- 
fore, PCA shows relationships among groups of varia- 
bles in a data set and shows relationships that exist be- 
tween objects. 

PCA has been used effectively to define relationships 
that exist in fatty acid characterization studies of food 
lipids due to its ability to manage and interpret large 



Corresponding author. E-mail: kmcho@gntech.ac.kr 
Phone: +82-55-751-3272, Fax: +82-55-751-3279 



Fatty Acid Profiles in Soybean Cultivars 



185 



data sets (7-9). For example, almond cultivars have been 
classified in several studies using chemometric techni- 
ques (10). Garcia-Lopez et al. (10) used cluster analysis 
of the major fatty acid composition in almond lipids to 
classify the cultivars into three groups. In a later study, 
the three groups of almond cultivars were further sub- 
divided by including minor fatty acids in the multivariate 
analysis (11). In recent study, Sathe et al. (12) applied 
cluster analysis to show that fatty acid composition of 
California grown almonds is influenced by cultivar, loca- 
tion of growth and crop year. 

Although soybean oil has been included in some che- 
mometric studies comparing vegetable oils, soybean cul- 
tivars have yet to be extensively classified using multi- 
variate techniques (13-15). This study proposes and de- 
scribes an analytical method to detect, identify and quan- 
tify fatty acid profiles in soybeans, applying GC with 
FID detection associated with PCA, a powerful chemo- 
metric approach. We conducted a study which re-exam- 
ined and updated compositional information of soybeans, 
including the levels of key bio-actives, grown in Korea. 
To ensure the most accurate data, an intensive sampling 
program was designed and implemented by the industry 
to provide cultivars in current production by accepted 
agricultural practices employed in Korea. Findings of the 
research will be used for the nutrient database of various 
soybean cultivars. This paper reports on the employment 
of PCA, a multivariate statistical method, to discriminate 
and classify field-grown soybean cultivars based upon 
their fatty acid profiles. Although soybean lipids have 
been included in some chemometric studies comparing 
vegetable oil fatty acid compositions, this paper provides 
more and advanced information concerning the applica- 
tion of PCA to the fatty acid profiles of a distinct sample 
set of only the most predominant commercial soybeans. 
Utilizing PCA will effectively reduce the number of vari- 
ables needed to classify soybean cultivars, and, in this 
manner, will permit soybean researchers (e.g., breeders, 
geneticists) to more easily develop significant relation- 
ships between important soybean characteristics. 

MATERIALS AND METHODS 
Materials 

Soybean samples (n=18), comprising 9 yellow (Sae- 
danbaek, Daewon, Daepung, Neulchan, Taekwang, Sunyu, 
Whanggeum, Daemang, & PungsannamuT), 6 black 
(Ilpumblack, Black #5, Chungja, Chungja #2, Chungja 
#3, & Dawon), 2 brown (Galmi & Galchae), and 1 
green, (Nokchae) were harvested in 2011 and were pro- 
vided by the National Institute of Crop Science (NICS) 
of the Rural Development Administration (RDA) in 



Miryang, Korea. After their arrival at Gyeongnam Nation- 
al University of Science and Technology, the soybeans 
were packaged in labeled vacuum pouches to prevent 
their degradation. The vacuum-packaged soybean sam- 
ples were stored at -40°C until analyzed. 

Chemicals 

HPLC-grade methanol (CH 3 OH), chloroform (CHC1 3 ), 
isooctane and ACS-grade boron trifluoride in methanol 
(BF 3 ), anhydrous sodium sulfate, and sodium chloride 
were purchased from Fisher Scientific Company LLC 
(Suwanee, GA, USA). Undecanoic acid (98% purity), 
and a variety of lipid standard mixture (37 FAMEs) were 
acquired from Sigma Chemical Company (St. Louis, MO, 
USA). 

Lipid extraction 

Raw soybeans were removed from the freezer and al- 
lowed to reach room temperature. Total lipids were ex- 
tracted according to the classical Bligh-Dyer method 
(16) with slight modifications. For each sample, 20 g 
of soybeans were finely ground in a commercial coffee 
mill. Five grams of the ground sample were accurately 
weighed and transferred to a 250-mL Erlenmeyer flask. 
Twenty milliliters of deionized FLO, 50 mL of CH3OH, 
25 mL of CHCI3, and ~10 mg of hydroquinone (as anti- 
oxidant) were added, and the contents were blended at 
5,400 rpm for 2 min using a Polytron®-type homogenizer 
(Pro Scientific INC, Monroe, CT, USA). After the initial 
blending, an additional 25 mL of CHCI3 were added, 
and homogenization was repeated for 1 min. Next, 25 
mL of deionized H 2 0 and 35 mL of CHCI3 were added; 
the mixture was again blended for 1 min. The slurry was 
filtered through Whatman No. 1 filter paper (GE Health- 
care Life Sci., Piscataway, NJ, USA) using a Biichner 
funnel under slight vacuum. Approximately 1 g of NaCl 
crystals was added to the filtrate to facilitate phase 
separation. The filtrate was quantitatively transferred to 
a 250-mL separatory funnel and allowed to stand over- 
night to permit complete separation of the layers. The 
CHCI3 phase was passed through Whatman No. 1 filter 
paper containing anhydrous Na2S04 to eliminate moisture 
and collected in a 100-mL round bottom flask. The CHCI3 
was removed under vacuum at <40°C using a Rotavapor/ 
Heating Bath (Model R-210 and B-491, respectively; 
Biichi Corporation, New Castle, DE, USA). The re- 
sultant lipid was transferred via a Pasteur pipette to an 
amber-colored vial. The round bottom flask was rinsed 
with a small portion of CHCI3 to ensure a quantitative 
transfer of the extracted lipids. Residual CHCI3 was re- 
moved from the vial using a stream of nitrogen. Samples 
were stored under a N 2 -headspace at -80°C until further 
analyzed. 
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Fatty acid methylation 

The extracted soybean lipids were used for fatty acid 
analysis. Fatty acid methyl esters (FAMEs) were pre- 
pared according to the Ngeh-Nawainbi's method with 
slight modifications (17). Briefly, extracted lipids (~25 
mg) were transferred to a Reacti-vial™ small reaction 
vial (5 mL, Thermo Fisher Scientific, Rockford, IL, USA), 
and the mass was accurately weighed. Undecanoic acid 
was employed as the internal standard (IS) for this work. 
One milliliter of IS (i.e., undecanoic acid 1 mg/mL in 
isooctane) was added to each Reacti-vial™. Soybean lip- 
ids were mixed with 0.5 N NaOH in methanol and flush- 
ed with a stream of nitrogen gas. The mixtures were 
heated at 100°C for 5 min. After cooling to room temper- 
ature, 2 mL of the 14% boron trifluoride in methanol 
(BF 3 /MeOH), along with a Reacti-vial™ magnetic stir- 
rer, were added to each vial, which was tightly capped, 
vortexed for 1 min, and placed in Reacti-Block™ B-l 
aluminum block within a Reacti-Therm III™ Heating/ 
Stirring Module (Thermo Fisher Scientific, Rockford, IL, 
USA) at 100°C for 30 min. After derivatization, samples 
were removed and allowed to cool to room temperature. 
Next, 5 mL of saturated NaCl solution was added to 
each reaction vial; the solution was vortexed for 30 s 
and fatty acid methyl esters (FAMEs) were extracted 
with 1.5 mL of isooctane. The isooctane layers were 
combined in a test tube. The isooctane layer was trans- 
ferred to 2-mL wide-opening crimp top vials. Vials were 
capped with 11 -mm silver aluminum caps, clear PTFE/ 
red rubber septa, and then crimped with a crimper. 

Gas chromatographic analysis 

An Agilent Technologies 6890N Network gas chroma- 
tograph system with FID detector (Agilent Technologeis 
Inc., Wilmington, DE, USA) was employed for fatty acid 
analysis profiling. Operating conditions were as follows: 
the column was a SP-2560 capillary column (100 mx0.25 
mm i.d., 0.25-um film thickness; Sigma-Aldrich Co., St. 
Louis, MO, USA); ultra-high purity nitrogen was the 
carrier gas at a flow rate of 1 mL/min and analyses were 
performed in constant flow mode; a split liner with glass 
wool was installed in the injector; the injector temper- 
ature was set at 220°C for split injection at a split ratio 
of 10:1; the FID temperature was set at 240°C; ultra-high 
purity hydrogen and scientific-grade air were the fuel 
gases for the FID and set at 40 mL/min and 450 mL/min, 
respectively; the initial oven temperature was set at 
140°C and held for 5 min before ramping up at 4°C/min 
to 230°C; this temperature was maintained for an addi- 
tional 35 min. Analyses were preformed in triplicate. 

Identification of fatty acids 

A Supelco 37 FAME reference standard was used to 



identify and quantity individual FAMEs from soybean 
samples. A relative response factor was calculated for 
each FAME using methyl undecanoate as an IS. Each 
FAME has a different response to the FID, depending 
on chain length, saturation, and cis/trans configuration 
(17): 

R ! = (PS,XWSC11:0) / (PScil:0XWS is ) 

where R, = relative response factor for fatty acid i; Ps, 
= peak area of individual FAME i in FAMEs standard 
solution; Wscii:o = mg of CI 1:0 FAME in injected 
FAMEs standard solution; Pscn : o = peak area of CI 1:0 
FAME in FAMEs standard solution; and Ws ls = mg of 
individual FAMEs i in injected FAMEs standard solution. 

Oil characteristics 

The ratio of oleic to linoleic (O/L) acid, iodine value 
(IV), the ratio of unsaturated to saturated fatty acids 
(U/S), and the percentage of saturation (% saturation) 
were calculated from GC fatty acid determinations ac- 
cording to the following formulae (18): 
O/L ratio = % oleic acid / % linoleic acid 
IV = (0.8601 x% oleic acid) + (1.7321 x% linoleic 
acid) + (2.616 x% linolenic acid) + (0.7854 x% 
gondoic acid) 
U/S = (% oleic + linoleic + linolenic + gondoic acids) / 
(% palmitic + stearic + arachidic + behenic + lig- 
noceric acids) 
% Saturation = (% palmitic + stearic + arachidic + be- 
henic + lignoceric acids) / (% palmitic + stearic + 
oleic + linoleic + linolenic + arachidic + gondoic + 
behenic + lignoceric acids) 

Principal component analysis (PCA) 

PCA was used for the classification and discrimination 
of soybeans. PCA is a multi-variate statistical method 
that entails data reconstruction and reduction. PCA gen- 
erates a set of new orthogonal axes or variables known 
as principal components (PCs), which are different from, 
but representative of, the original variables. The data sets 
presented on the orthogonal axes are uncorrected with 
one another, and express much of the total variability 
in the data set through comparison of only a few PCs 
(19). The maximal amount of variance in the data set 
and its direction are often explained by the first PC (i.e., 
PCI). Each PC is defined by a vector known as the ei- 
genvector of the variance-covariance matrix. The var- 
iance along the vector is known as the eigenvalue. Each 
eigenvalue, the amount of variance that is explained by 
a given component, was used for the determination of 
variances of the major PCs. The loadings (or scores) cor- 
responding to the PCs were calculated from the correla- 
tion matrix (20). The arithmetic value of each PC is de- 
termined by followed equation (21): 
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where x\ are measurements of the original variables, *i 
are mean values for the corresponding variables, SD\ are 
standard deviations for the corresponding variables, and 
ai are loadings of the linear transformation. 

Each loading of variables was used for the con- 
tribution of the original variable to the PC. Variance re- 
duction was achieved by neglecting the unimportant di- 
rections in which samples' variance are insignificant. 
The important variables are along several significant di- 
rections, and the number of these directions approx- 
imates the dimensionality of the sample set. For a visual- 
ization of the data discrimination, PCA plots mapped 
variables and samples through dimensional spaces de- 
termined by PCs with eigenvalues greater than 1.0, based 
on Kaiser's rule (22). The plots depicted the identi- 
fication and correlation of variables and objectives. 
Analysis of fatty acid composition in soybean samples 
was performed in triplicate. Results are expressed as the 
mean ± standard deviation (SD). 

RESULTS AND DISCUSSION 
Fatty acid profiles 

A representative GC chromatogram of the FAMEs 
from soybean lipid extracts is depicted in Fig. 1. A total 
of 11 fatty acids containing 2 trace (>0.1% weight) fatty 
acids were identified in this study by retention time map- 
ping with external standards and quantified relative to 
an internal standard (i.e., methyl undecanoate, the FAME 
resulting from the derivatization of CI 1:0). Response 
factors of individual fatty acids were shown in Table 
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Fig. 1. Representative GC chromatogram of fatty acid methyl 
esters (FAMEs) from the lipid extract of soybean cultivars 
(peak 1, myristic acid (C14:0); 2, palmitic acid (C16:0); 3, 
palmitoleic acid (CI 6:1, ©7); 4, stearic acid (C18:0); 5, oleic 
acid (C18:l, ©9); 6, linoleic acid (C18:2, ©6); 7, arachidic acid 
(C20:0); 8, gondoic acid (C20:l, ©9); 9, linolenic acid (C18:3, 
©3); 10, behenic acid (C22:0); and 11, lignoceric acid (C24:0). 



Table 1. Response factor of individual fatty acids 



Fatty acid 


Response factor 


ri4-o 


1.10 


C16:0 


1.28 


C16:l ©7 


1.20 


C18:0 


1.36 


C18:l ©9 


1.36 


C18:2 ©6 


1.35 


C18:3 ©3 


1.42 


C20:0 


1.35 


C20:l ©9 


1.39 


C22:0 


1.46 


C24:0 


1.46 



1 for the method validation. Table 2 reports the fatty 
acid compositions of lipid extracts from 18 soybean 
cultivars. Palmitic acid (CI 6:0) ranged from 9.18% to 
13.02%; stearic acid (C18:0), 2.80% to 4.74%; oleic acid 
(C18:l, ©9), 16.52% to 33.80%; linoleic acid (C18:2, 
0)6), 44.72% to 60.34%; linolenic acid (CI 8:3, ©3), 5.62 
% to 9.54%; arachidic acid (C20:0), 0.26% to 0.48%; 
gondoic acid (C20:l, m9), 0.22% to 0.41%; behenic acid 
(C22:0), 0.27% to 0.59%; and lignoceric acid (C24:0), 
0.00% to 0.39%. These results correspond with those 
from other studies and show that the sum of oleic and 
linoleic acids accounts for almost 80% of the total fatty 
acids detected in soybean samples (1,3,23). The oleic 
to linoleic acid (O/L) ratio is a quality index employed 
for the determination of genetic soybean characteristics 
and it ranges from 0.29 to 0.82. In other lipid character- 
istics, IV, a degree of unsaturation, ranged from 123.71 
to 139.78; U/S, 4.60 to 5.91; % saturation, 14.39 to 17.69. 
Pearson correlation coefficients between fatty acid varia- 
bles are given in Table 3. The correlation between oleic 
acid and linoleic acid was strong but negative (r= -0.94, 
p<0.05); that is, an increase in one fatty acid leads to 
a corresponding decrease in the other. This negative rela- 
tionship originates from the biochemical pathways of 
soybean development: in the soybean germplasm, palmi- 
toyl CoA is elongated to stearoyl CoA followed by desa- 
turation forming oleic acid. Then, the action of oleoyl- 

12 

phosphatidylcholine desaturase (a A -fatty acid desatur- 
ase) synthesizes linoleic acid from oleic acid (24). In 
addition, stearic acid is positively correlated to arachidic 
acid (r=0.72, p<0.05), while oleic acid is negatively 
correlated to linolenic acid (r=-0.50, p<0.05). 

Principal component of analysis (PCA) 

The data matrix of the predominant 9 fatty acids in 
the soybean samples analyzed was subjected to PCA to 
decrease the number of descriptors associated with the 
data set, while still explaining the maximum amount of 
variability present in the data. Table 4 shows the most 
significant PCs generated from the soybean fatty acid 
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Table 3. Pearson correlation coefficients between percentage levels of nine fatty acids from lipid extracts of soybeans grown 
in Korea (w=18) 
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cd9 
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Symbol corresponds the significant values at the level of significance a=0.05 (two-tailed test). 



Table 4. Eigen analysis of the correlation matrix loadings of 
the significant principal components (PCs) 



Fatty acid 1 ' 
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Eigenvalue 
Variance (%) 
Cumulative (%) 


2.53 
28.16 
28.16 


2.07 
23.04 
51.20 


1.54 
17.15 

68.35 


1.18 
13.14 
81.49 



''Nine fatty acid from extracts of soybeans grown in Korea. 
2 '+, positive correlation; -, negative correlation. 



data and their statistical loadings in the current study. 
A new set of nine orthogonal variables (PCs) was gen- 
erated by PCA. The first principal component (i.e., PCI) 
had the highest eigenvalue of 2.53, and accounted for 
28.16% of the variability in the data set. The second, 
third, and fourth PCs (PC2, PC3, and PC4) had eigenval- 
ues of 2.07, 1.54, and 1.18, and accounted for 23.04%, 
17.15% and 13.14% of the variance in the data, respec- 
tively (Note: only eigenvalues of >1.0 are considered 
significant descriptors of data variance, according to 
Kaiser's rule). The remaining five generated PCs (PC5 
to PC9) yielded progressively smaller eigenvalues and 
did not explain significant variability in the data (<19% 
total). Therefore, according to Kaiser's rule, only the 
first four PCs were used for further study. These numbers 
represent significant contributions of individual fatty 
acid variables to the total variability explained by the 
generated PCs. PCI describes 28.16% of the variance 
in the data set, and its loadings indicate that it has high 
contributions from oleic (0.53), linoleic (-0.50), and gon- 
doic (-0.42) acid variables. PC2 showed positive load- 
ings for stearic (0.53) and linolenic (0.42) acids and a 
negative loading for arachidic acid (0.42) and PC3 was 



most described by behenic (0.57) and lignoceric (0.60) 
acids. PC4 shows a positive loading for palmitic acid 
(0.59). Specific patterns of correlation between the varia- 
bles tested can be visualized by comparing loading plots 
between the PCs, as shown in the four planes (I, II, III, 
and IV) on the PC1-PC2 and PC3-PC4 axes in Fig. 2A 
and 2B. The objective of a loading projection is to visu- 
alize the position of the variables with respect to one 
another in two-dimensional space and their correspond- 
ing correlations. Variables closest to one another and far 
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Fig. 2. Loading plots of PC1-PC2 and PC3-PC4 for fatty acid 
profiles in soybeans. (A) PC1-PC2 and (B) PC3-PC4. 
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from the plot origin are positively correlated, while vari- 
ables opposite one another on the plot are negatively 
correlated. The two loading plots generated from the data 
of Table 4 can explain the relationships between two 
variables by their angle from the center. The correlation 
coefficient between two variables is defined as the cosine 
of the angle between their respective vectors on the plot. 
The cosine of 180° (i.e., the angle between oleic acid 
and linoleic acid on the PC1-PC2 plot) is 1; therefore, 
they are negatively correlated. Based on this mathemat- 
ical rule, uncorrelated variables occur at right angles to 
one another because the cosine of the angle between 
them is cosine 90° = 0, or not correlated. Similarly, the 
cosine of 0° is 1, which denotes a positive correlation 
between the variables (25). Fig. 3 (A and B) depicts 
the score plots of fatty acids in soybean cultivars («=18) 
generated from comparing PC1-PC2 and PC3-PC4, and 
revealed four district sample groupings in the four planes 
(I, II, III, and IV) on the PCI PC2 and PC3-PC4 axes. 
For Fig. 3(A), plane I, with the vectors of stearic and 
linolenic acids showing a positive correlation (see Fig. 
2A-I), contains samples from the Ilpumblack, Galchae 
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Fig. 3. Score plots of PC1-PC2 and PC3-PC4 for fatty acid 
profiles in soybeans. (A) PC1-PC2 and (B) PC3-PC4. Cultivar 
symbols: (•) Yellow; (■) Black; (a) Brown; (♦) Green. 



cultivars. Plane II is especially descriptive of a negative 
contribution of PCI and a positive contribution of PC2. 
It predominately contains Neulchan, Daepung, and 
Chungja #3 cultivars, which possess high linoleic acid 
content, but lower oleic acid content relative to other 
cultivars in the sample set (Fig. 2A-II and Fig. 3A-II). 
These relationships are apparent in Table 4, which de- 
notes the negative vector loadings of arachidic acid to 
PC2, and the positive vector loadings of stearic acid to 
PC2. Plane III is especially described by a negative con- 
tribution from PCI and PC2. This plane comprises most- 
ly Galmi, Whanggeum, Chungja #2, and Punsannamul 
samples. Observations of plane III reveal the lower quan- 
tities of oleic acid compared to other samples in the sam- 
ple set (Fig. 2A-III and Fig. 3A-III). Plane IV is indicative 
of a positive contribution of PCI and a negative con- 
tribution of PC2. It contains mostly Black #5, Saedan- 
baek, and Taekwang cultivars (Fig. 2A-IV and Fig. 
3A-IV). Observations of plane IV show that it contains 
higher levels of oleic and behenic acids content relative 
to the other samples in the sample set. For Fig. 3(B), 
plane I demonstrates higher levels of palmitic acid and 
is comprised of Chungja #3, Saedanbaek, Whanggeum, 
and Nokchae samples (Fig. 2B-I and Fig. 3B-I); plane 
II is observed to contain lower levels of lignoceric acid 
and is comprised of Daemang, Taekwang, Black #5 and 
Pungsannamul cultivars (Fig. 2B-II and Fig. 3B-II). 
Plane III reveals higher levels of oleic acid in the culti- 
vars Neulchan, Chungja, Daewon, and Daepung (Fig. 
2B-III and Fig. 3B-III). Lastly, analysis of plane IV in- 
dicates higher levels of lignoceric acid in the Chungja 
#2, Ilpumblack, and Galchae samples (Fig. 2B-IV and 
Fig. 3B-IV). Mohamed and Rangappa (23) analyzed fat- 
ty acid profiles in seventeen soybeans and noted that the 
content of oleic acid was inversely related to the concen- 
trations of linoleic (/— 0.82) and linolenic (>=-0.66) acids. 
Other investigations reported fatty acid profiles in soy- 
bean paste during fermentation using PCA as a tool, 
however, they didn't classify soybeans by cultivars (26). 

This research specifically focused on Korean field- 
grown soybean cultivars and showed how the contribu- 
tions of individual fatty acids related to the generated 
PCs; individual fatty acid contributions to the total varia- 
bility of a PC are often omitted from other investigations, 
but are invaluable (23). Eigen analysis of the correlation 
matrix loadings of the four significant PCs revealed that 
PCI was mainly contributed to by oleic, linoleic, and 
gondoic acids, PC2 by stearic, linolenic, and arachdic 
acids, PC3 by behenic and lignoceric acids and PC4 by 
palmitic acid. When the loading and score plots for soy- 
bean cultivars were projected as PC1-PC2 and PC3-PC4 
groupings, there was an evident reduction in the number 
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of variables necessary for the discrimination of soybean 
cultivars. PCs 1 through 4 together were found to be 
explanatory of more than 81.49% of the total variability 
in the data set. Statistical examinations need to be con- 
ducted routinely with data generated by the most up-to- 
date scientific methods and technologies to maintain the 
highest degree of validity. Furthermore, this study clearly 
indicates that the combination of experimental GC fatty 
acid data along with a chemometric approach (PCA, in 
this case) can be successfully employed by soybean re- 
searchers to give more information on variation in soy- 
bean cultivars than is capable with the experimental data 
alone. 
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